Fix bug in FsaFromTensor for empty FSA; make index select from ragged… #481

danpovey · 2020-12-09T07:20:53Z

… print out error cause

danpovey · 2020-12-09T07:21:28Z

When using this to debug the determinization failure in snowfall

[F] /ceph-dan/k2/k2/python/csrc/torch/index_select.cu:at::Tensor k2::SimpleRaggedIndexSelect1D(at::Tensor, k2::Ragged<int>&) [with T = int]:208 There must be at most one non-zero element in src fo\
r any sub-list in indexes; sub-list 30861212 has too many elements: [ 4 200004 ]

danpovey · 2020-12-09T07:24:03Z

I think we should be able to fix the determinization failure in snowfall by removing the disambiguation symbols before turning the ragged tensor of symbols into a linear one.

qindazhu · 2020-12-09T07:29:41Z

I didn't print values before, but I have tried to remove disambiguation symbols back to that time before doing determinize, it still has the same issues. Anyway, let me try to print values with your lastest code.

danpovey · 2020-12-09T07:32:23Z

No, they need to be there for the determinization, that is the point, or determinization won't terminate. [edit from dan: later realized this is not true of the aux_labels, the det symbols are only needed on the phone side.] Need to remove them after determinization but before trying to map the aux_labels from ragged to linear.

…

On Wed, Dec 9, 2020 at 3:29 PM Haowen Qiu ***@***.***> wrote: I didn't print values before, but I have tried to remove disambiguation symbols back to that time before doing determinize, it still has the same issues. Anyway, let me try to print values with your lastest code. — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <#481 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAZFLO7YSMNXWESBZHM4ZHLST4RPHANCNFSM4UTCGROA> .

danpovey · 2020-12-09T07:33:35Z

The call to determinize() can leave the aux_labels as ragged; the user can convert them to linear after removing the disambig symbols.

…

On Wed, Dec 9, 2020 at 3:32 PM Daniel Povey ***@***.***> wrote: No, they need to be there for the determinization, that is the point, or determinization won't terminate. Need to remove them after determinization but before trying to map the aux_labels from ragged to linear. On Wed, Dec 9, 2020 at 3:29 PM Haowen Qiu ***@***.***> wrote: > I didn't print values before, but I have tried to remove disambiguation > symbols back to that time before doing determinize, it still has the same > issues. Anyway, let me try to print values with your lastest code. > > — > You are receiving this because you modified the open/close state. > Reply to this email directly, view it on GitHub > <#481 (comment)>, or > unsubscribe > <https://github.com/notifications/unsubscribe-auth/AAZFLO7YSMNXWESBZHM4ZHLST4RPHANCNFSM4UTCGROA> > . >

qindazhu · 2020-12-09T07:38:04Z

Yeah, that's exactly what I did before (sorry for wrong expression above), I save aux_labels and labels before determinizing, and remove code SimpleRaggedIndexSelect in python determinize, then after determinization, I called SimpleRaggedIndexSelect on the the saved axu_labels before determinize.

qindazhu · 2020-12-09T10:55:31Z

/ceph-hw/k2/k2/python/csrc/torch/index_select.cu:at::Tensor k2::SimpleRaggedIndexSelect1D(at::Tensor, k2::Ragged<int>&) [with T = int]:208 There must be at most one non-zero element in src for any sub-list in indexes; sub-list 30860097 has too many elements: [ 4 77469 ]
@danpovey , the above exmaple is a failure (4 is word A and 77469 is word HARD) I get with below code:

diff --git a/k2/python/k2/fsa_algo.py b/k2/python/k2/fsa_algo.py
index 0c14d94..23e6449 100644
--- a/k2/python/k2/fsa_algo.py
+++ b/k2/python/k2/fsa_algo.py
@@ -316,21 +313,14 @@ def determinize(fsa: Fsa) -> Fsa:
         Otherwise, a new deterministic fsa is returned and the
         input ``fsa`` is NOT modified.
     '''
-    properties = getattr(fsa, 'properties', None)
-    if properties is not None \
-            and properties & fsa_properties.ARC_SORTED_AND_DETERMINISTIC != 0: # noqa
-        return fsa

     ragged_arc, arc_derivs = _k2.determinize(fsa.arcs)
-    aux_labels = None
-    if hasattr(fsa, 'aux_labels'):
-        aux_labels = _k2.simple_ragged_index_select(fsa.aux_labels, arc_derivs)
-    out_fsa = Fsa(ragged_arc, aux_labels)
+    out_fsa = Fsa(ragged_arc, aux_labels=None)

     for name, value in fsa.named_non_tensor_attr():
         setattr(out_fsa, name, value)

-    return out_fsa
+    return out_fsa, arc_derivs

diff --git a/snowfall/decoding/graph.py b/snowfall/decoding/graph.py
index d709e03..6d677a1 100644
--- a/snowfall/decoding/graph.py
+++ b/snowfall/decoding/graph.py

@@ -32,22 +28,27 @@ def compile_LG(
     """
     L_inv = k2.arc_sort(L.invert_())
     G = k2.arc_sort(G)
-    logging.debug("Intersecting L and G")
+    logging.info("Intersecting L and G")
     LG = k2.intersect(L_inv, G)
-    logging.debug(f'LG shape = {LG.shape}')
-    logging.debug("Connecting L*G")
+    logging.info(f'LG shape = {LG.shape}')
+    logging.info("Connecting L*G")
     LG = k2.connect(LG).invert_()
-    logging.debug(f'LG shape = {LG.shape}')
-    logging.debug("Determinizing L*G")
-    LG = k2.determinize(LG)
-    logging.debug(f'LG shape = {LG.shape}')
-    logging.debug("Connecting det(L*G)")
+    logging.info(f'LG shape = {LG.shape}')
+    aux_labels = LG.aux_labels
+    logging.info("Determinizing L*G")
+    LG, arc_derivs = k2.determinize(LG)
+    LG.labels[LG.labels >= labels_disambig_id_start] = 0
+    aux_labels[aux_labels >= aux_labels_disambig_id_start] = 0
+    logging.info('select aux_labels')
+    LG.aux_labels = k2.simple_ragged_index_select(aux_labels, arc_derivs)
+    logging.info(f'LG shape = {LG.shape}')
+    logging.info("Connecting det(L*G)")
     LG = k2.connect(LG)
     logging.debug(f'LG shape = {LG.shape}')
     logging.debug("Removing disambiguation symbols on L*G")
-    LG.labels[LG.labels >= labels_disambig_id_start] = 0
-    LG.aux_labels[LG.aux_labels >= aux_labels_disambig_id_start] = 0
     LG = k2.add_epsilon_self_loops(LG)

danpovey · 2020-12-09T11:12:08Z

I'd rather you call it arc_map rather than arc_derivs.
Not sure yet why this would happen; it feels like a bug.

danpovey · 2020-12-09T11:17:07Z

And please show me output with sizes, so I can compare with my setup.

qindazhu · 2020-12-09T11:20:07Z

what sizes do you need? num_states of determinized fsa?

danpovey · 2020-12-09T15:59:38Z

OK, I did some debugging and it doesn't appear to be a bug. I was just wrong to think that this type of situation would not occur.

After thinking about various possible solutions, I think the best one is to 'cut the Gordian knot' and simply abandon the assumption that the aux_labels will be at most one per frame. That is: to leave it as a Ragged tensor. When doing further operations like epsilon removal, we need to use the appropriate method of indexing a ragged tensor with a ragged tensor, that will append the input lists. (I'll have to check whether the operation already exists, but it's basically indexing the old ragged tensor with the values of the index, using ComposeRaggedShapes to compose the shapes, removing axis 1, and using that as the shape of the output ragged object.

I think we should probably create a RaggedTensor object, that holds a RaggedShape and a Tensor, for these kinds of purposes.

qindazhu · 2020-12-10T01:21:25Z

indexing a ragged tensor with a ragged tensor

Ah, OK. It sounds like ComposeArcMaps? I was thought we need something like index (aux) array with a ragged array (arc_map).

BTW, do you think we should do something like expand sequence of aux_labels from Ragged to 1-D Array currently? as we do int host invertion.

danpovey · 2020-12-10T03:28:58Z

indexing a ragged tensor with a ragged tensor

Ah, OK. It sounds like ComposeArcMaps? I was thought we need something like index (aux) array with a ragged array (arc_map).

Yes, seems it's the same operation as ComposeArcMaps. BTW, do you think we should do something like expand sequence of aux_labels

from Ragged to 1-D Array currently? as we do int host invertion.

Not sure what you are saying here, try to be more explicit.

…

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#481 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAZFLO6DEDNIO6TF5HDS2GLSUAPCFANCNFSM4UTCGROA> .

qindazhu · 2020-12-10T03:37:28Z

I mean, for example, if one arc has two aux_labels, say [aux1, aux2], then we expand the arc to two arcs, one arc is (arc.src_state, temp_state, label=0, aux1, score=0), another arc is (temp_state, arc.dest_state, arc.label, aux2, arc.score). We expand all arcs so that we get a new Fsa with only one aux_label one arc.

danpovey · 2020-12-10T03:44:27Z

OK, so expanding the FSA to accommodate at most one label per arc. (There are also "pushing" ways to do this). The reason I'm not so hot on this is that our decoding methods do not handle epsilons in the decoding graph.

…

On Thu, Dec 10, 2020 at 11:37 AM Haowen Qiu ***@***.***> wrote: I mean, for example, if one arc has two aux_labels, say [aux1, aux2], then we expand the arc to two arcs, one arc is (arc.src_state, temp_state, label=0, aux1, score=0), another arc is (temp_state, arc.dest_state, arc.label, aux2, arc.score). We expand all arcs so that we get a new Fsa with only one aux_label one arc. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#481 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAZFLO4JKCMGDRKN6S7NQ3DSUA7AJANCNFSM4UTCGROA> .

danpovey · 2020-12-10T06:04:37Z

How about we make so the tensor_attr's can actually contain not just Tensor but also RaggedInt? I think that might be the path of least resistance.

qindazhu · 2020-12-10T06:24:33Z

Agree, as RaggedInt has the same Dim0 with all tensor_attr's in the fsa(vec), i.e. the number of arcs.

BTW, for case that one arc has more than two aux_labels, we need to keep the order of arc-map values in each row of the RaggedInt, seems we have keep that in host version,

k2/k2/csrc/host/determinize.cc

Lines 193 to 196 in b59d4b1

    
           // The following is mostly for ease of interpretability of the output; 
        
           // conceptually the order makes no difference. 
        
           // TODO(dpovey): maybe remove this, for efficiency? 
        
           std::reverse(deriv_out->begin(), deriv_out->end());

Just mention this in case we forget it in the future, as for non-top-sorted version, the order may be non-ascending?

danpovey · 2020-12-10T06:32:24Z

Yes, the order is going to be kept.

…

On Thu, Dec 10, 2020 at 2:24 PM Haowen Qiu ***@***.***> wrote: Agree, as RaggedInt has the same Dim0 with all tensor_attr's in the fsa(vec), i.e. the number of arcs. BTW, for case that one arc has more than two aux_labels, we need to keep the order of arc-map values in each row of the RaggedInt, seems we have keep that in host version, https://github.com/k2-fsa/k2/blob/b59d4b11ebfa8351fe75578475109ff3c86f10f7/k2/csrc/host/determinize.cc#L193-L196 Just mention this in case we forget it in the future, as for non-top-sorted version, the order may be non-ascending? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#481 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAZFLO32GAPRHJPAA2RM3MDSUBSS5ANCNFSM4UTCGROA> .

Fix bug in FsaFromTensor for empty FSA; make index select from ragged…

8ead216

… print out error cause

danpovey merged commit ffc3590 into k2-fsa:master Dec 9, 2020

qindazhu mentioned this pull request Dec 21, 2020

wrap rm_epsilon_iterative_tropical to python #523

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bug in FsaFromTensor for empty FSA; make index select from ragged… #481

Fix bug in FsaFromTensor for empty FSA; make index select from ragged… #481

danpovey commented Dec 9, 2020

danpovey commented Dec 9, 2020

danpovey commented Dec 9, 2020

qindazhu commented Dec 9, 2020

danpovey commented Dec 9, 2020 via email •

edited

Loading

danpovey commented Dec 9, 2020 via email •

edited

Loading

qindazhu commented Dec 9, 2020

qindazhu commented Dec 9, 2020 •

edited

Loading

danpovey commented Dec 9, 2020

danpovey commented Dec 9, 2020

qindazhu commented Dec 9, 2020

danpovey commented Dec 9, 2020

qindazhu commented Dec 10, 2020

danpovey commented Dec 10, 2020 via email

qindazhu commented Dec 10, 2020

danpovey commented Dec 10, 2020 via email

danpovey commented Dec 10, 2020

qindazhu commented Dec 10, 2020

danpovey commented Dec 10, 2020 via email

Fix bug in FsaFromTensor for empty FSA; make index select from ragged… #481

Fix bug in FsaFromTensor for empty FSA; make index select from ragged… #481

Conversation

danpovey commented Dec 9, 2020

danpovey commented Dec 9, 2020

danpovey commented Dec 9, 2020

qindazhu commented Dec 9, 2020

danpovey commented Dec 9, 2020 via email • edited Loading

danpovey commented Dec 9, 2020 via email • edited Loading

qindazhu commented Dec 9, 2020

qindazhu commented Dec 9, 2020 • edited Loading

danpovey commented Dec 9, 2020

danpovey commented Dec 9, 2020

qindazhu commented Dec 9, 2020

danpovey commented Dec 9, 2020

qindazhu commented Dec 10, 2020

danpovey commented Dec 10, 2020 via email

qindazhu commented Dec 10, 2020

danpovey commented Dec 10, 2020 via email

danpovey commented Dec 10, 2020

qindazhu commented Dec 10, 2020

danpovey commented Dec 10, 2020 via email

danpovey commented Dec 9, 2020 via email •

edited

Loading

danpovey commented Dec 9, 2020 via email •

edited

Loading

qindazhu commented Dec 9, 2020 •

edited

Loading